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DETAILED ACTION 
Claim Rejections - 35 USC § 102 

1 . The following is a quotation of the appropriate paragraphs of 35 
U.S.C. 102 that form the basis for the rejections under this section made in this 
Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 
122(b), by another filed in the United States before the invention by the applicant for patent or 
(2) a patent granted on an application for patent by another filed in the United States before 
the invention by the applicant for patent, except that an international application filed under 
the treaty defined in section 351(a) shall have the effects for purposes of this subsection of an 
application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 

2. Claims 1, 4-7, and 10-12 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Shriberg et al. (Speech Communication Publication). 

3. Regarding claims 1 and 11-12, Shriberg et al. disclose a method, a 
computer usable medium having computer readable program code, and a digital 
audio processing system for the segmentation of an audio stream into semantic 
or syntactic units wherein the audio stream is provided in a digitized format, 
comprising the steps of: 

determining a fundamental frequency for the digitized audio stream 
(Section 2.1.2.3 on page 133)\ 

detecting changes of the fundamental frequency in the audio stream 
(pages 134-135, refer to figure 4)\ 
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determining candidate boundaries for the semantic or syntactic units 
depending on the detected changes of the fundamental frequency (pages 134- 
135); 

extracting at least one prosodic feature in the neighborhood of the 
candidate boundaries {pages 130-131); 

determining boundaries for the semantic or syntactic units depending on 
the at least one prosodic feature (pages 134-135, F0 is a prosodic feature). 

4. Regarding claims 4-7 and 10, Shriberg et al. further disclose a method for 
extracting at least one prosodic feature in an environment of the audio stream 
where the value of the index function is equal 0 (section 2. 1. 1 on page 130 
discusses feature extraction of both voice and unvoiced portions), that the 
environment is a time period between 500 and 4000 milliseconds (Section 2.1.1 
on page 130), at least one prosodic feature is represented by the fundamental 
frequency (Section 2.1.1, page 130), the extracting step involves extracting at 
least two prosodic features and combining the at least two prosodic features 
(Section 2. 1.2, page 131), and a step of performing a prosodic feature 
classification based on a predetermined classification tree (section 2.1.2 on page 
131, grouping features). 

Claim Rejections - 35 USC § 103 

5. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 
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(a) A patent may not be obtained though the invention is not identically disclosed or described 
as set forth in section 1 02 of this title, if the differences between the subject matter sought to 
be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which 
said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

6. Claims 2-3, 8, and 1 3 are rejected under 35 U.S.C. 1 03(a) as being 
unpatentable over Shriberg et al. (Speech Communication Publication) in view of 
Yeldener et al. (US Patent No. 5774837). 

7. Regarding claims 2-3 and 13, Shriberg et al. do not disclose a method for 
providing a threshold value for the voicedness of the fundamental frequency 
estimates and determining whether the voicedness of fundamental frequency 
estimates is lower than the threshold value, and for defining an index function for 
the fundamental frequency having a value =0 if the voicedness of the 
fundamental frequency is lower than the threshold value and having a value =1 if 
the voicedness of the fundamental frequency is higher than the threshold value. 

However, Yeldener et al. teach a method for providing a threshold value 
for the voicedness of the fundamental frequency estimates and determining 
whether the voicedness of fundamental frequency estimates is lower than the 
threshold value (col. 15, In. 1 to col. 16, 1. 14), and for defining an index function 
for the fundamental frequency having a value =0 if the voicedness of the 
fundamental frequency is lower than the threshold value and having a value =1 if 
the voicedness of the fundamental frequency is higher than the threshold value 
(col. 14, In. 4-55, the goal is to use 0 and 1 to represent for unvoiced and voice 
portions, respectively). 
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Since Shriberg et al. and Yeldener et al. are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of 
ordinary skill in the art at the time of invention to modify Shriberg et al. by 
incorporating the teaching of Yeldener et al. in order to enable the system to pay 
more coding emphasis on the voice portion than unvoiced portion to reduce 
processing time and increase transmission rate. 

8. Regarding claim 8, Shriberg et al. do not disclose a method that first 
detect speech and non-speech segments in the digitized audio stream and 
performing the steps of claim 1 thereafter only for detected speech segments. 
However, Yeldener et al. teach a method that first detect speech and non-speech 
segments in the digitized audio stream and performing the steps of claim 1 
thereafter only for detected speech segments (col. 14, In. 5-55). 

Since Shriberg et al. and Yeldener et al. are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of 
ordinary skill in the art at the time of invention to modify Shriberg et al. by 
incorporating the teaching of Yeldener et al. in order to enable the system to pay 
more coding emphasis on the voice portion than unvoiced portion to reduce 
processing time and increase transmission rate. 

9. Claims 9 and 14 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Shriberg et al. (Speech Communication Publication) in view of 
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Yeldener et al. (US Patent No. 5774837), as applied to claims 8 and 13 above, 
and further in view of Eryilmaz (US Patent No. 5867574). 

10. Regarding claims 9 and 14, the modified Shriberg et al., as applied to 
claims 8 and 13 above, fail to disclose a method of detecting of speech and non- 
speech segments comprises utilizing the signal energy or signal energy changes, 
respectively, in the audio stream. However, Eryilmaz teaches a method of 
detecting of speech and non-speech segments comprises utilizing the signal 
energy or signal energy changes, respectively, in the audio stream (coL 3, In. 40 
to col. 4, In. 54). 

Since the modified Shriberg et al. and Eryilmaz are analogous art because 
they are from the same field of endeavors, it would have been obvious to one of 
ordinary skill in the art at the time of invention to further modify Shriberg et al. by 
incorporating the teaching of Eryilmaz in order to enhance the detection of voice 
portion in the signal when background noise is present. 

Conclusion 

Any inquiry concerning this communication or earlier communications from 
the examiner should be directed to Huyen Vo whose telephone number is 703- 
305-8665. The examiner can normally be reached on M-F, 9-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Doris To can be reached on 703-305-4827. The fax 
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phone number for the organization where this application or proceeding is 
assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from 
the Patent Application Information Retrieval (PAIR) system. Status information 
for published applications may be obtained from either Private PAIR or Public 
PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see http://pair- 
direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll- 
free). 

Examiner Huyen X. Vo. July 28, 2004 





